System and method for generating an advertisement effectiveness performance score

ABSTRACT

A system and method for generating an advertisement effectiveness performance score of a multimedia content element displayed in a webpage are provided. The method includes determining an advertisement context of an advertisement; analyzing metadata associated with at least one prior advertisement to determine a prior advertisement success score; determining a multimedia context of the multimedia content element displayed in the webpage; matching the advertisement context to the multimedia context to determine a context matching score; and generating an advertisement effectiveness score at least based on the prior advertisement success score and the context matching score.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No. 61/939,290 filed on Feb. 13, 2014. This application is a continuation-in-part (CIP) of U.S. patent application Ser. No. 13/770,603 filed on Feb. 19, 2013, now pending, which is a CIP of U.S. patent application Ser. No. 13/624,397 filed on Sep. 21, 2012, now pending. The Ser. No. 13/624,397 application is a CIP of:

(a) U.S. patent application Ser. No. 13/344,400 filed on Jan. 5, 2012, now pending, which is a continuation of U.S. patent application Ser. No. 12/434,221, filed May 1, 2009, now U.S. Pat. No. 8,112,376;

(b) U.S. patent application Ser. No. 12/195,863, filed Aug. 21, 2008, now U.S. Pat. No. 8,326,775, which claims priority under 35 USC 119 from Israeli Application No. 185414, filed on Aug. 21, 2007, and which is also a continuation-in-part of the below-referenced U.S. patent application Ser. No. 12/084,150; and,

(c) U.S. patent application Ser. No. 12/084,150 having a filing date of Apr. 7, 2009, now U.S. Pat. No. 8,655,801, which is the National Stage of International Application No. PCT/IL2006/001235, filed on Oct. 26, 2006, which claims foreign priority from Israeli Application No. 171577 filed on Oct. 26, 2005, and Israeli Application No. 173409 filed on Jan. 29, 2006.

All of the applications referenced above are herein incorporated by reference for all that they contain.

TECHNICAL FIELD

The present disclosure relates generally to the analysis of multimedia content displayed in a webpage, and more specifically to a system for generating an advertisement effectiveness performance score respective of multimedia content elements displayed in a webpage.

BACKGROUND

The Internet, also referred to as the worldwide web (WWW), has become a mass media content platform where the content presentation is largely supported by paid advertisements that are added to webpage content. Typically, advertisements are displayed using portions of code written in, for example, hyper-text mark-up language (HTML) or JavaScript that is inserted into, or otherwise called up by, documents also written in HTML and which are sent to a user node for display. A webpage typically contains text and multimedia elements that are intended for display on the user's display device.

One of the most common types of advertisements on the Internet is in a form of a banner advertisement. Banner advertisements are generally images or animations that are displayed within a webpage. Other advertisements are simply inserted at various locations within the display area of the document. A typical webpage displayed today is cluttered with many advertisements, which frequently are irrelevant to the content being displayed, and as a result the user's attention is not given to them. When each user's attention is not given to advertisements, the click-rate of these advertisements as well as the conversion rate decrease and, as a result, the advertising value of these advertisements decrease. Consequently, the advertising price of a potentially valuable display area may be low because its respective effectiveness is low.

In the context of advertising, many techniques are used to ensure that viewers of the advertisement are more likely to be paying attention to the advertisement. In particular, advertisers often provide advertisements to target audiences that are likely to be interested in the product and, therefore, will likely pay more attention to the advertisement than the average consumer. To this end, advertisers may seek to insert their advertisements into content (or on webpages containing such content) that they believe their target audiences will view and engage with.

Advertisers are more likely to believe such modification is worthwhile if, e.g., the advertisers know that certain types of advertisements are more likely to be viewed by a particular audience than for other audiences. For example, an audience viewing a blog with travel tips for vacation may be more inclined to pay attention to advertisements for plane tickets or services than the typical audience. With the advent of the Internet, new content can come out much more frequently than with content provided by other mediums such as television or radio. Such sites can potentially have hundreds, thousands, or even millions of items of content uploaded daily. Further, the number of potential views of such content can reach its peak in a very short amount of time. Consequently, quickly and efficiently determining effective placements for advertisements for web-based content has become increasingly important. As such, it would be useful for advertisers to be able to obtain information regarding the relation of the advertisement to webpages and content contained therein and, particularly, whether the advertisement and such content overlaps sufficiently that advertising would be worthwhile to the advertisers.

Existing solutions typically often require advertisers to perform significant market research to determine the most effective advertisements. Additionally, advertisers are often required to provide advertisements respective of hosts of webpages rather than based on specific content contained within webpages. However, advertisers would generally prefer to provide advertisements that are tailored or otherwise more related to the particular multimedia content presented on webpages.

It would therefore be advantageous to provide a solution that would determine advertisement effectiveness score of multimedia content elements when displayed with a reference to advertisements.

SUMMARY

A summary of several example embodiments of the disclosure follows. This summary is provided for the convenience of the reader to provide a basic understanding of such embodiments and does not wholly define the breadth of the disclosure. This summary is not an extensive overview of all contemplated embodiments, and is intended to neither identify key or critical elements of all aspects nor delineate the scope of any or all embodiments. Its sole purpose is to present some concepts of one or more embodiments in a simplified form as a prelude to the more detailed description that is presented later. For convenience, the term some embodiments may be used herein to refer to a single embodiment or multiple embodiments of the disclosure.

Certain embodiments include a method for generating an advertisement effectiveness performance score of a multimedia content element displayed in a webpage. The method comprises determining an advertisement context of an advertisement; analyzing metadata associated with at least one prior advertisement to determine a prior advertisement success score; determining a multimedia context of the multimedia content element displayed in the webpage; matching the advertisement context to the multimedia context to determine a context matching score; and generating an advertisement effectiveness score at least based on the prior advertisement success score and the context matching score.

Certain embodiments include a system for generating an advertisement effectiveness performance score of multimedia content elements displayed in a webpage. The method comprises a processing unit; and a memory, the memory containing instructions that when, executed by the processing unit, configured the system to: determine an advertisement context of an advertisement; analyze metadata associated with at least one prior advertisement to determine a prior advertisement success score; determine a multimedia context of the multimedia content element displayed in the webpage; match the advertisement context to the multimedia context to determine a context matching score; and generate an advertisement effectiveness score at least based on the prior advertisement success score and the context matching score.

BRIEF DESCRIPTION OF THE DRAWINGS

The subject matter disclosed herein is particularly pointed out and distinctly claimed in the claims at the conclusion of the specification. The foregoing and other objects, features, and advantages of the disclosed embodiments will be apparent from the following detailed description taken in conjunction with the accompanying drawings.

FIG. 1 is a schematic block diagram of a network system utilized to describe the various embodiments disclosed herein;

FIG. 2 is a flowchart describing the process of matching an advertisement to multimedia content displayed on a webpage;

FIG. 3 is a block diagram depicting the basic flow of information in the signature generator system;

FIG. 4 is a diagram showing the flow of patches generation, response vector generation, and signature generation in a large-scale speech-to-text system;

FIG. 5 is a flowchart describing the process of adding an overlay to multimedia content displayed on a webpage;

FIG. 6 is a flowchart describing a method for determining the context indicated by the relation between multimedia content elements displayed in a web page; and

FIG. 7 is a flowchart describing a method for generating an advertisement effectiveness performance score for multimedia content elements existing in a webpage according to an embodiment.

DETAILED DESCRIPTION

It is important to note that the embodiments disclosed herein are only examples of the many advantageous uses of the innovative teachings herein. In general, statements made in the specification of the present application do not necessarily limit any of the various claimed embodiments. Moreover, some statements may apply to some inventive features but not to others. In general, unless otherwise indicated, singular elements may be in plural and vice versa with no loss of generality. In the drawings, like numerals refer to like parts through several views.

Certain exemplary embodiments disclosed herein include a system and method for generating an advertisement effectiveness performance score to multimedia content elements embedded, for example, in a webpage. Accordingly, one or more multimedia content elements embedded in a webpage are analyzed and at least one signature is generated for such element. Then, the signatures are analyzed to determine the concept of each of the signatures and the context of each of the multimedia content elements respective thereto.

Metadata is also received respective of the multimedia content elements. The metadata includes at least one of: a number of clicks over the multimedia content elements, impressions, the content type of each multimedia content element, a combination thereof, and so on. The metadata can be used to identify the advertisement effectiveness of the multimedia content element(s). The metadata is analyzed respective of the context determined for multimedia content elements and an advertisement effectiveness performance score is generated for each multimedia content element respective of the analysis. The advertisement effectiveness performance scores may be stored in a data warehouse.

FIG. 1 shows an exemplary and non-limiting schematic diagram of a network system 100 utilized to describe the various embodiments disclosed herein. A network 110 is used to communicate between different parts of the system 100. The network 110 may be the Internet, the world-wide-web (WWW), a local area network (LAN), a wide area network (WAN), a metro area network (MAN), and other network capable of enabling communication between the elements of the system 100.

Further connected to the network 110 are one or more client applications, such as web browsers (WB) 120-1 through 120-n (hereinafter referred to collectively as web browsers 120 or individually as a web browser 120). A web browser 120 is executed over a computing device such as, for example, a personal computer (PC), a personal digital assistant (PDA), a mobile phone, a smart phone, a tablet computer, a wearable computing device, and other kinds of wired and mobile appliances, equipped with browsing, viewing, listening, filtering, and managing capabilities, etc., that are enabled as further discussed herein below.

The system 100 also includes a plurality of information sources 150-1 through 150-m (hereinafter referred to collectively as information sources 150 or individually as an information source 150) connected to the network 110. Each of the information sources 150 may be, for example, a web server, an application server, a publisher server, an ad-serving system, a data repository, a database, and the like. Also connected to the network 110 is a data warehouse 160 configured to store multimedia content elements, clusters of multimedia content elements, and the context determined for a webpage as identified by its URL. In the embodiment illustrated in FIG. 1, a context server 130 is configured to communicate with the data warehouse 160 through the network 110. In other non-limiting configurations, the context sever 130 is directly connected to the data warehouse 160.

The various embodiments disclosed herein are realized using the context server 130 and a signature generator system (SGS) 140. The SGS 140 may be connected to the context server 130 directly (as shown in FIG. 1) or through the network 110 (not shown). The context server 130 is enabled to receive and serve multimedia content elements and causes the SGS 140 to generate a signature respective of the multimedia content elements. The process for generating signatures for multimedia content is explained in more detail herein below with respect to FIGS. 3 and 4. It should be noted that each of the context server 130 and the SGS 140 typically comprises a processing unit (e.g., the processing units 131 and 141, respectively), such as a processor, that is coupled to a memory (e.g., the memories 132 and 142, respectively). The memory 132 and the memory 142 contain instructions that can be executed by the processing unit 131 and the processing unit 141, respectively. The context server 130 also includes an interface to the network 110.

Each processing unit may comprise, or be a component of, a larger processing unit implemented with one or more processors. The one or more processors may be implemented with any combination of general-purpose microprocessors, microcontrollers, digital signal processors (DSPs), field programmable gate array (FPGAs), programmable logic devices (PLDs), controllers, state machines, gated logic, discrete hardware components, dedicated hardware finite state machines, or any other suitable entities that can perform calculations or other manipulations of information.

Each processing unit may also include machine-readable media for storing software. Software shall be construed broadly to mean any type of instructions, whether referred to as software, firmware, middleware, microcode, hardware description language, or otherwise. Instructions may include code (e.g., in source code format, binary code format, executable code format, or any other suitable format of code). The instructions, when executed by the processing unit, cause the processing unit to perform the various functions described herein. In another embodiment, the processing unit may be realized as an array of computational cores.

According to the disclosed embodiments, the context server 130 is configured to receive at least a URL of a webpage hosted in an information source 150 and accessed by a web browser 120. The context server 130 is further configured to analyze the multimedia content elements contained in the webpage to determine their context, thereby ascertaining the context of the webpage. The analysis is performed based on at least one signature generated for each multimedia content element. It should be noted that the context of an individual multimedia content element or a group of multimedia content elements may be extracted from the webpage, received from a user of a web browser 120 (e.g., uploaded video clip), or retrieved from the data warehouse 160.

According to the embodiments disclosed herein, a user visits a webpage using a web browser 120. When the webpage is uploaded on the user's web browser 120, a request is sent to the context server 130 to analyze the multimedia content elements contained in the webpage. The request to analyze the multimedia content elements, and therefore the advertising effectiveness performance score, can be generated and sent by a script executed in the webpage, an agent installed in the web browser, or by one of the information sources 150 (e.g., a web server or a publisher server) when requested to upload one or more advertisements to the webpage. The request to analyze the multimedia content may include a URL of the webpage or a copy of the webpage. In one embodiment, the request may include multimedia content elements extracted from the webpage. A multimedia content element may include, for example, an image, a graphic, a video stream, a video clip, an audio stream, an audio clip, a video frame, a photograph, and an image of signals (e.g., spectrograms, phasograms, scalograms, etc.), and/or combinations thereof and portions thereof.

The context server 130 is configured to analyze the multimedia content elements in the webpage to determine their context. For example, if the webpage contains images of palm trees, a beach, and the coast line of San Diego, the context of the webpage may be determined to be “California sea shore.” According to one embodiment, the determined context can be utilized to detect one or more matching advertisements for the multimedia content elements. According to this embodiment, the SGS 140 is configured to generate at least one signature for each multimedia content element provided by the context server 130. The generated signature(s) may be robust to noise and distortion as discussed further herein below with respect to FIGS. 3 and 4.

Using the generated signature(s), the context server 130 is configured to determine the context of the elements and search the data warehouse 160 for a matching advertisement based on the context. For example, if the signature of an image indicates a “California sea shore”, then an advertisement for a swimsuit, a beach towel, or a California vacation can be potential matching advertisements.

It should be noted that using signatures for determining the context and for the searching of advertisements ensures more accurate reorganization of multimedia content than, for example, when using metadata. For instance, in order to provide a matching advertisement for a sports car, it may be desirable to locate a car of a particular model. However, in most cases the model of the car would not be part of the metadata associated with the multimedia content (an image). Moreover, the car shown in an image may be at angles different from the angles of a specific photograph of the car that is available as a search item. The signature generated for that image would enable accurate recognition of the model of the car because the signatures generated for the multimedia content elements, according to the disclosed embodiments, allow for recognition and classification of multimedia content elements such as, but not limited to, content-tracking, video filtering, multimedia taxonomy generation, video fingerprinting, speech-to-text, audio classification, element recognition, video/image search, and any other application requiring content-based signatures generation and matching for large content volumes such as, web and other large-scale databases.

In one embodiment, the signatures generated for more than one multimedia content element are clustered. The clustered signatures are used to determine the context of the webpage and to search for a matching advertisement. The one or more selected matching advertisements are retrieved from the data warehouse 160 and uploaded to the webpage on the web browser 120.

FIG. 2 depicts an exemplary and non-limiting flowchart 200 describing the process of matching an advertisement to multimedia content displayed on a webpage. In S205, a webpage is uploaded to a web browser (e.g., the web browser 120-1). In S210, a request to match at least one multimedia content element contained in the uploaded webpage to an appropriate advertisement is received. The request can be received from a publisher server, a script running on the uploaded webpage, or an agent (e.g., an add-on) installed in the web browser. S210 can also include extracting the multimedia content elements for a signature that should be generated.

In S220, at least one signature is generated for the multimedia content element executed from the webpage. The signature for the multimedia content element is generated by a signature generator (e.g., the SGS 140) as described further herein below with respect to FIGS. 3 and 4. In an embodiment, based on the generated signatures, the context of the extracted multimedia content elements, and thus the context of the webpage, is determined as described further herein below with respect to FIG. 6.

In S230, an advertisement item is matched to the multimedia content element respective of its generated signatures and/or the determined context. According to one embodiment, the matching process includes searching for at least one advertisement based on the signature of the multimedia content and a display of the at least one advertisement item within the display area of the webpage. According to another embodiment, the signatures generated for the multimedia content elements are clustered and the cluster of signatures is matched to one or more advertisements. According to yet another embodiment, the matching of an advertisement to a multimedia content element can be performed by computational cores that are part of a large scale matching system as discussed in further detail herein below with respect to FIGS. 3 and 4.

In S240, upon identification of a user gesture, the matching advertisement is uploaded to the webpage and displayed therein. The user gesture may be, but is not limited to, a scroll on the multimedia content element, a press on the multimedia content element, and/or a response to the multimedia content. Uploading the matching advertisement only upon identifying a user gesture ensures that the user's attention is given to the advertised content. In S250, it is determined whether there are additional requests to analyze multimedia content elements, and if so, execution continues with S210; otherwise, execution terminates.

As a non-limiting example, an image that contains a plurality of multimedia content elements is identified in an uploaded webpage. At least one signature is generated for each multimedia content element executed in the webpage. According to this embodiment, a printer and a scanner are shown in the image, and signatures are generated respective thereto. The context of the image is determined to be “office equipment.” Therefore, at least one advertisement that is suitable for users viewing content related to office equipment is matched to the multimedia content element.

FIGS. 3 and 4 illustrate the generation of signatures for the multimedia content elements according to one embodiment. An exemplary high-level description of the process for large scale matching is depicted in FIG. 3. In this example, the matching is for video content.

Video content segments 2 from a Master database (DB) 6 and a Target DB 1 are processed in parallel by a large number of independent computational Cores 3 that constitute an architecture for generating the Signatures (hereinafter the “Architecture”). Further details on the computational Cores generation are provided below. The independent Cores 3 generate a database of Robust Signatures and Signatures 4 for Target content-segments 5 and a database of Robust Signatures and Signatures 7 for Master content-segments 8. An exemplary and non-limiting process of signature generation for an audio component is shown in detail in FIG. 4. Finally, Target Robust Signatures and/or Signatures are effectively matched, by a matching algorithm 9, to Master Robust Signatures and/or Signatures database to find all matches between the two databases.

To demonstrate an example of the signature generation process, it is assumed, merely for the sake of simplicity and without limitation on the generality of the disclosed embodiments, that the signatures are based on a single frame, leading to certain simplification of the computational cores generation. The Matching System is extensible for signatures generation capturing the dynamics in-between the frames.

The Signatures' generation process is now described with reference to FIG. 4. The first step in the process of signatures generation from a given speech-segment is to breakdown the speech-segment to K patches 14 of random length P and random position within the speech segment 12. The breakdown is performed by the patch generator component 21. The value of the number of patches K, random length P and random position parameters is determined based on optimization, considering the tradeoff between accuracy rate and the number of fast matches required in the flow process of the context server 130 and SGS 140. Thereafter, all the K patches are injected in parallel into all computational Cores 3 to generate K response vectors 22, which are fed into a signature generator system 23 to produce a database of Robust Signatures and Signatures 4.

In order to generate Robust Signatures, i.e., Signatures that are robust to additive noise L (where L is an integer equal to or greater than 1) by the Computational Cores 3 a frame ‘i’ is injected into all the Cores 3. Then, Cores 3 generate two binary response vectors: {right arrow over (S)} which is a Signature vector, and {right arrow over (RS)} which is a Robust Signature vector.

For generation of signatures robust to additive noise, such as White-Gaussian-Noise, scratch, etc., but not robust to distortions, such as crop, shift and rotation, etc., a core Ci={ni} (1≤i≤L) may consist of a single leaky integrate-to-threshold unit (LTU) node or more nodes. The node ni equations are:

$V_{i} = {\sum\limits_{j}\;{w_{ij}k_{j}}}$ n_(i) = Π(Vi − Th_(x))

where,

is a Heaviside step function; w_(ij) is a coupling node unit (CNU) between node i and image component j (for example, grayscale value of a certain pixel j); k_(j) is an image component ‘j’ (for example, grayscale value of a certain pixel j); Thx is a constant Threshold value, where ‘x’ is ‘S’ for Signature and ‘RS’ for Robust Signature; and Vi is a Coupling Node Value.

The Threshold values Thx are set differently for Signature generation and for Robust Signature generation. For example, for a certain distribution of Vi values (for the set of nodes), the thresholds for Signature (Th_(S)) and Robust Signature (Th_(RS)) are set apart, after optimization, according to at least one or more of the following criteria:

-   -   1: For: V_(i)>Th_(RS)         -   1−p(V>Th_(S))−1−(1−ε)^(l)<<1             i.e., given that l nodes (cores) constitute a Robust             Signature of a certain image I, the probability that not all             of these I nodes will belong to the Signature of same, but             noisy image, Ĩ is sufficiently low (according to a system's             specified accuracy).     -   2: p(V_(i)>Th_(RS))≈l/L         i.e., approximately l out of the total L nodes can be found to         generate a Robust Signature according to the above definition.     -   3: Both Robust Signature and Signature are generated for certain         frame i.

It should be understood that the generation of a signature is unidirectional, and typically yields lossless compression, where the characteristics of the compressed data are maintained but the uncompressed data cannot be reconstructed. Therefore, a signature can be used for the purpose of comparison to another signature without the need of comparison to the original data. The detailed description of the Signature generation can be found in U.S. Pat. Nos. 8,326,775 and 8,312,031, assigned to common assignee, which are hereby incorporated by reference for all the useful information they contain.

A Computational Core generation is a process of definition, selection, and tuning of the parameters of the cores for a certain realization in a specific system and application. The process is based on several design considerations, such as:

(a) The Cores should be designed so as to obtain maximal independence, i.e., the projection from a signal space should generate a maximal pair-wise distance between any two cores' projections into a high-dimensional space.

(b) The Cores should be optimally designed for the type of signals, i.e., the Cores should be maximally sensitive to the spatio-temporal structure of the injected signal, for example, and in particular, sensitive to local correlations in time and space. Thus, in some cases a core represents a dynamic system, such as in state space, phase space, edge of chaos, etc., which is uniquely used herein to exploit their maximal computational power.

(c) The Cores should be optimally designed with regard to invariance to a set of signal distortions, of interest in relevant applications.

A detailed description of the Computational Core generation and the process for configuring such cores is discussed in more detail in U.S. Pat. No. 8,655,801 referenced above.

FIG. 5 depicts an exemplary and non-limiting flowchart 500 describing the process of adding an overlay to a multimedia content element displayed on a webpage. In S510, a webpage is uploaded to a web browser (e.g., the web browser 120-1). In another embodiment, the method starts when a web server (e.g., the web server 150-1) receives a request to host the requested webpage.

In S515, a uniform resource locator (URL) of the uploaded webpage is received. In an embodiment, the URL is received by a context server (e.g., the context server 130). In a further embodiment, the uploaded webpage includes an embedded script. In that embodiment, the embedded script extracts the URL of the webpage and sends the URL to the context server. In another embodiment, an add-on installed in the web browser extracts the URL of the uploaded webpage and sends the URL to the context server.

In yet another embodiment, an agent is installed on a user device executing the web browser. The agent is configured to monitor webpages uploaded to the website, extract the URLs, and send the extracted URLs to the context server. In another embodiment, a web server (e.g., the web server 150) hosting the requested webpage provides the context server with the URL of the requested webpage. It should be noted that only URLs of selected websites can be sent to the context server, for example, URLs related to websites that paid for the additional information.

In S520, the webpage is downloaded respective of each received URL. In an embodiment, the webpage is downloaded by the context server. In S525, the webpage is analyzed in order to identify the existence of one or more multimedia content elements in the webpage. It should be understood that a multimedia content element, such as an image or a video, may include a plurality of multimedia content elements. In S530, at least one signature is generated for each identified multimedia content element. In an embodiment, the signatures are generated by an SGS (e.g., the SGS 140). The signatures for the multimedia content elements are generated as described in greater detail above with respect to FIGS. 3 and 4.

In S535, respective of the generated signatures, the context of each multimedia content element is determined. In an embodiment, the context is determined by the context server. The determination of the context based on the signatures is discussed in more detail herein below with respect to FIG. 6. In S540, respective of the context of each multimedia content element, at least one link to content that exists on a web server is determined. A link may be, but is not limited to, a hyperlink, a URL, and so on.

The content accessed through the link may be, for example, informative webpages such as the Wikipedia® website. The determination of the link may be made by identification of the context of the generated signatures. As an example, if the context of the multimedia content elements was identified as a particular football player, then a link to a sports website that contains information about the football player is determined.

In S550, the determined link is added as an overlay to the webpage respective of the corresponding multimedia content element. According to one embodiment, a link that contains the overlay may be provided to a web browser (e.g., browser 120-1) respective of a user's gesture. A user's gesture may be, but is not limited to, a scroll on the multimedia content element, a click on the multimedia content element, and/or a response to the multimedia content or a portion thereof.

The modified webpage that includes the multimedia content element with the added link can be sent directly to a web browser that is requesting the webpage. This direct sending requires establishing a data session between the context server and the web browser. In another embodiment, the multimedia content element including the added link is returned to a web server (e.g., e.g., the information source 150) hosting the requested webpage. The web server returns the requested webpage with the multimedia element containing the added link to the web browser requesting the webpage. Once the “modified” webpage is displayed over the web browser, a detection of a user gesture over the multimedia element would prompt the web browser to upload the content accessed by the link added to the multimedia element.

In S560, it is checked whether the one or more multimedia content elements contained in the webpage has changed and, if so, execution continues with S525; otherwise, execution terminates.

As a non-limiting example, a webpage containing an image of the movie “Pretty Woman” is uploaded to a context server. A signature is generated by a SGS respective of the actor Richard Gere and the actress Julia Roberts, both shown in the image. The context of the signatures according to this example may be “American Movie Actors”. An overlay containing links to Richard Gere's biography and Julia Roberts' biography on the Wikipedia® website is added over the image such that upon detection of a user's gesture, for example, a mouse clicking over the part of the image where Richard Gere is shown, the link to Richard Gere's biography on Wikipedia® is provided to the user of the mouse.

According to another embodiment, a webpage that contains an embedded video clip and a banner advertising New York City is requested by a web browser from an information source. A context server receives the requested URL. The context server analyzes the video content and the banner within the requested webpage, and a signature is generated by a SGS respective of the entertainer Madonna that is shown in the video content and the banner. The context of multimedia content embedded in the webpage is determined to be “live pop shows in NYC.” In response to the determined context, a link to a hosted website for purchasing show tickets is added as an overlay to the video clip. The webpage together with the added link is sent to a web server (e.g., the information source 150-1), which then uploads the requested webpage with the modified video element to the web browser.

The webpage may contain a number of multimedia content elements; however, in some instances only a few links may be displayed in the webpage. Accordingly, in one embodiment, the signatures generated for the multimedia content elements are clustered and the cluster of signatures is matched to one or more advertisement items.

FIG. 6 illustrates a method for determining a context of a multimedia content element. In S610, a webpage is uploaded to a web browser (e.g., the web browser). In another embodiment, the method starts when a request to host the requested webpage is received by a web server (e.g., web browser 150-1).

In S620, the uniform resource locator (URL) of the webpage to be processed is received. In an embodiment, the URL is received by a context server (e.g., the context server 130). In another embodiment, the uploaded webpage includes an embedded script. The embedded script extracts the URL of the webpage and sends the URL to the context server 130. In another embodiment, an add-on installed in the web browser extracts the URL of the uploaded webpage and sends the URL to the context server.

In yet another embodiment, an agent is installed on a user device executing the web browser. The agent is configured to monitor webpages uploaded to the website, extract the URLs, and send the URLs to the context server. In another embodiment, a web server (e.g., the information source 150-1) hosting the requested webpage provides the context server with the URL of the requested webpage. It should be noted that only URLs of selected websites can be sent to the context server, for example, URLs related to websites that paid for the additional information.

In S630, the webpage respective of each received URL is downloaded. In an embodiment, the webpage is downloaded by a context server (e.g., the context server 130). In S640, the webpage is analyzed in order to identify the existence of one or more multimedia content elements in the uploaded webpage. In an embodiment, the analysis is performed by the context server 130. Each identified multimedia content element is extracted from the webpage and sent to the SGS 140. In S650, for each multimedia content element identified by the context server, at least one signature is generated. In an embodiment, the signatures are generated by an SGS (e.g., the SGS 140). The at least one signature is robust for noise and distortion. The signatures for the multimedia content elements are generated as described in greater detail above in FIGS. 3 and 4. It should also be noted that signatures can be generated for portions of a multimedia content element.

In S660, the signatures are analyzed to determine the correlation among the signatures of all extracted multimedia content elements, or portions thereof. In an embodiment, the signatures are analyzed by the context server. Specifically, each signature may be associated with a different concept. The signatures are analyzed to determine the correlation between concepts. A concept is an abstract description of the content to which the signature was generated. For example, a concept of the signature generated for a picture showing a bouquet of red roses may be “flowers.” The correlation between concepts can be achieved by identifying a ratio between signatures' sizes, a spatial location of each signature, and so on using probabilistic models. As noted above, a signature represents a concept and is generated for a multimedia content element. Thus, identifying, for example, the ratio of signatures' sizes may also indicate the ratio between the size of their respective multimedia elements.

A context is determined as the correlation among a plurality of concepts. A strong context is determined when an amount of concepts above a threshold satisfy the same predefined condition. The threshold may be predetermined. As an example, signatures generated for multimedia content elements of a smiling child with a Ferris wheel in the background are analyzed. The concept of the signature associated with the image of the smiling child is “amusement” and the concept of a signature associated with the image of the Ferris wheel is “amusement park.” The relation between the signatures of the child and recognized wheel is further analyzed to determine that the Ferris wheel is bigger than the child. The relation analysis determines that the Ferris wheel is used to entertain the child. Therefore, the determined context may be “amusement.”

According to one embodiment, one or more typically probabilistic models may be used to determine the correlation between signatures representing concepts. The probabilistic models determine, for example, the probability that a signature may appear in the same orientation and in the same ratio as another signature. When performing the analysis, information maintained in a database (e.g., the data warehouse 160) may be used including, for example, previously analyzed signatures. In S670, based on the analysis performed in S660, the context of a plurality of multimedia content elements existing in the webpage and in the context of the webpage are determined. In an embodiment, the context is determined by the context server.

As an example, an image that contains a plurality of multimedia content elements is identified in an uploaded webpage. At least one signature is generated for each multimedia content element of the plurality of multimedia content elements that exist in the image. According to this example, multimedia content elements of the singer “Adele”, the “red carpet,” and a “Grammy” award are shown in the image. Signatures are generated respective thereto. The context server 130 analyzes the correlation between “Adele,” the “red carpet,” and a “Grammy” award, and determines the context of the image based on the correlation. According to this example, such a context may be “Adele Winning a Grammy Award”.

As another example, a webpage containing a plurality of multimedia content elements is identified in an uploaded webpage. According to this example, signatures are generated for the objects “glass,” “cutlery,” and “plate,” which appear in the multimedia elements. The correlation between the concepts generated by signatures is analyzed respective of the data maintained in a database such as, for example, analysis of previously generated signatures. According to this example, because all of the concepts of the “glass”, the “cutlery”, and the “plate” satisfy the same predefined condition, a strong context is determined. The context of such concepts may be a “table set”. The context can be also determined respective of a ratio of the sizes of the objects (glass, cutlery, and plate) in the image and the distinction of their spatial orientation.

In S680, the context of the multimedia content together with the respective signatures is stored in a database (e.g., the data warehouse 160) for future use. In S690, it is checked whether there are additional webpages and, if so, execution continues with S610; otherwise, execution terminates.

FIG. 7 is an exemplary and non-limiting flowchart 700 illustrating a method for generating an advertisement effectiveness score according to an embodiment. In S710, a request to generate an advertisement effectiveness score for at least one multimedia content element is received. The request may include, for example, a URL of a webpage containing the multimedia content element and an advertisement. The advertisement is likely to be displayed with respect to a multimedia content element and/or in the webpage. For example, the advertisement can be displayed over the multimedia content element. It should be noted that the advertisement is an online advertisement which may be in a form of multimedia content.

In an embodiment, the URL is received by a context server (e.g., the context server 130). In another embodiment, the webpage includes an embedded script (e.g., a JavaScript). The script may be programmed to extract the URL of the webpage and/or the multimedia content element, and send the URL and/or the multimedia content element to the context server. In another embodiment, an add-on installed in a web browser (e.g., the web browser 120) extracts the URL of the uploaded webpage and sends the URL to the context server.

In S720, a context of the advertisement is determined. In an embodiment, S720 includes generating at least one signature for the advertisement and determining matching context(s) based on the at least one generated signature(s) as described further herein above with respect to FIG. 6.

In S730, metadata associated with advertisements that were previously displayed with respect to (e.g., as an overlaid on) the multimedia content element contained in the webpage is analyzed. The received metadata may be, but is not limited to, a number of interactions of users with an advertisement, a duration of advertisement overlay, a type of interaction of users with an advertisement, and so on. The received metadata is related to information that is useful in determining an effectiveness of the previous advertisements with respect to the multimedia content element contained in the webpage.

In a further embodiment, the only metadata associated with prior advertisements that have the same or a similar context with the advertisement are analyzed. Two contexts may be similar or the same if, for example, signatures representing the contexts demonstrate matching above a predefined threshold. Signature matching is described further herein above with respect to FIGS. 3 and 4. As an example, if the advertisement to be overlaid has a context of “vacuum cleaners,” then only metadata demonstrating the effectiveness of other advertisements related to vacuum cleaners or other cleaning products may be received.

In an embodiment, the analysis yields a prior advertisement success score. The prior advertisement success score may be, but is not limited, to a numerical value, e.g., a value between 1 and 10, where a value of 1 represents low success and a value of 10 represents high success. In a further embodiment, the prior advertisement success score may be decreased if the contexts of the prior advertisements do not match the context of the advertisement to be overlaid above a predefined threshold.

As a non-limiting example of determining prior advertisement success scores, if previous advertisements that have the same or similar context as the advertisement to be overlaid were only displayed for 5 seconds but yielded 7,000 clicks in 10,000 views, the prior advertisement success score may be determined to be 9 on a scale from 1 to 10. As another non-limiting example, if previous advertisements that have the same or similar context as the advertisement to be overlaid were only displayed for the entire 2 minute video and yielded 7,000 clicks in 10,000 views, the prior advertisement success score may be determined to be 7 on a scale from 1 to 10.

In S740, a context of the multimedia content element contained in the webpage is determined. Determining contexts of multimedia content elements contained in webpages is described further herein above with respect to FIG. 6.

In S750, the context of the multimedia content element contained in the webpage and the context of the advertisement are matched. In an embodiment, this matching may involve conducting signature matching between signatures that represent the context of the multimedia content element and signatures that represent the context of the advertisement. Signature matching is described further herein above with respect to FIGS. 3 and 4. This context matching yields a context matching score. The context matching score is typically a numerical value such as, but not limited to, a value between 1 and 10, where a value of 1 represents low matching and a value of 10 represents high matching.

In S760, an advertisement effectiveness score is generated based on the prior advertisement success score and the context matching score. The advertisement effectiveness score is a value representing the likelihood that the advertisement will capture a particular user's attention. The advertisement effectiveness score may be, but is not limited to, a numerical value (e.g., 1 through 10), a grade value (e.g., F through A, with F being the lowest grade and A being the highest grade), a categorical value (e.g., one of poor, moderate, and good), and so on.

The advertisement effectiveness score may be generated by, but is not limited to, adding the prior advertisement success score and the context matching score, multiplying the prior advertisement success score by the context matching score, and performing other operations on the prior advertisement success score and the context matching score.

In various embodiments, the prior advertisement success score and the context matching score may be assigned weighted values that affect the influence of each score in determining the advertisement effectiveness score. In an embodiment, if no prior advertisements have been overlaid on the multimedia content element contained in the webpage, the prior advertisement success score may be assigned a weight of 0. In an embodiment, once a numerical value for the advertisement effectiveness score is determined, the numerical value may then be converted to, e.g., a grade value or a categorical value. As an example, a numerical advertisement effectiveness score of 9 may be converted into a grade value of “A” or a categorical value of “good.”

In another embodiment, if the advertisement effectiveness score is determined to be above a predefined threshold, the advertisement may be provided to a user device be overlaid on the multimedia content element displayed on the webpage. This embodiment allows advertisers to choose to have appropriate advertisements automatically sent to users while ensuring that the sent advertisements are more likely to garner significant user attention.

As a non-limiting example of determining an advertisement effectiveness score based on the analysis of the metadata, two images of the same refrigerator are shown in an electronic-commerce webpage. Metadata received respective of both of the images indicates that more users interact with one image of the refrigerator than the other. Therefore, the advertisement effectiveness of the first image is determined as higher than the advertisement effectiveness of the second image.

The various embodiments disclosed herein can be implemented as hardware, firmware, software, or any combination thereof. Moreover, the software is preferably implemented as an application program tangibly embodied on a program storage unit or computer readable medium consisting of parts, or of certain devices and/or a combination of devices. The application program may be uploaded to, and executed by, a machine comprising any suitable architecture. Preferably, the machine is implemented on a computer platform having hardware such as one or more central processing units (“CPUs”), a memory, and input/output interfaces. The computer platform may also include an operating system and microinstruction code. The various processes and functions described herein may be either part of the microinstruction code or part of the application program, or any combination thereof, which may be executed by a CPU, whether or not such a computer or processor is explicitly shown. In addition, various other peripheral units may be connected to the computer platform such as an additional data storage unit and a printing unit. Furthermore, a non-transitory computer readable medium is any computer readable medium except for a transitory propagating signal.

All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the principles of the disclosed embodiments and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Moreover, all statements herein reciting principles, aspects, and embodiments of the disclosure, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure. 

What is claimed is:
 1. A method for generating an advertisement effectiveness score of a multimedia content element displayed in a webpage, comprising: determining an advertisement context of an advertisement; analyzing metadata associated with at least one prior advertisement to determine a prior advertisement success score; determining a multimedia context of the multimedia content element displayed in the webpage, wherein the multimedia context is determined based on at least one signature generated for the multimedia content element displayed in the webpage; matching the advertisement context to the multimedia context to determine a context matching score; and generating the advertisement effectiveness score at least based on the prior advertisement success score and the context matching score; and generating, by a signature generator system, signatures for a plurality of multimedia content elements that are clustered to determine context of the webpage utilized to search for a matching advertisement, wherein upon identification of a user gesture and determining the advertisement effective score is above a predefined threshold, the matching advertisement is generated and overlaid on the multimedia content element displayed in the webpage, wherein the user gesture is any one of: a scroll on the multimedia content element, a press on the multimedia content element, and a response to the multimedia content elements.
 2. The method of claim 1, wherein the analyzed metadata is any one of: a number of interactions of users with a prior advertisement, a duration of overlay of a prior advertisement, and a type of interaction of users interacting with a prior advertisement.
 3. The method of claim 1, wherein analyzing the metadata associated with the at least one prior advertisement to determine a prior advertisement success score further comprises: determining a plurality of prior advertisement contexts of the at least one prior advertisement; matching each prior advertisement context of the plurality of prior advertisement contexts to the advertisement context; upon determining that at least one prior advertisement matches the advertisement context above a predefined threshold, analyzing metadata associated with the at least one matching prior advertisement.
 4. The method of claim 3, further comprising: upon determining that at least one prior advertisement does not match the advertisement context above the predefined threshold, reducing the prior advertisement success score.
 5. The method of claim 1, wherein the advertisement effectiveness score is a value representing the likelihood that the advertisement will capture a particular user's attention when overlaid on the multimedia content element, wherein the value is any one of: a numerical value, a grade value, and a categorical value.
 6. The method of claim 1, wherein the multimedia content element is at least one of: an image, a graphic, a video stream, a video clip, an audio stream, an audio clip, a video frame, a photograph, and an image of signals.
 7. A non-transitory computer readable medium having stored thereon instructions for causing one or more processing units to execute the method according to claim
 1. 8. A system for generating an advertisement effectiveness score of a multimedia content element displayed in a webpage, comprising: a processing unit; and a memory, the memory containing instructions that when, executed by the processing unit, configured the system to: determine an advertisement context of an advertisement; analyze metadata associated with at least one prior advertisement to determine a prior advertisement success score; determine a multimedia context of the multimedia content element displayed in the webpage, wherein the multimedia context is determined based on at least one signature generated for the multimedia content element displayed in the webpage; match the advertisement context to the multimedia context to determine a context matching score; and generate the advertisement effectiveness score at least based on the prior advertisement success score and the context matching score; generate, by a signature generator system, signatures for a plurality of multimedia content elements that are clustered to determine context of the webpage utilized to search for a matching advertisement, wherein upon identification of a user gesture and determining the advertisement effective score is above a predefined threshold, the matching advertisement is generated and overlaid on the multimedia content element displayed in the webpage, wherein the user gesture is any one of: a scroll on the multimedia content element, a press on the multimedia content element, and a response to the multimedia content elements.
 9. The system of claim 8, wherein the analyzed metadata is any one of: a number of interactions of users with a prior advertisement, a duration of overlay of a prior advertisement, and a type of interaction of users interacting with a prior advertisement.
 10. The system of claim 8, wherein the system is further configured to: determine a plurality of prior advertisement contexts of the at least one prior advertisement; match each prior advertisement context of the plurality of prior advertisement contexts to the advertisement context; upon determining that at least one prior advertisement matches the advertisement context above a predefined threshold, analyze metadata associated with the at least one matching prior advertisement.
 11. The system of claim 10, wherein the system is further configured to: upon determining that at least one prior advertisement does not match the advertisement context above the predefined threshold, reduce the prior advertisement success score.
 12. The system of claim 8, wherein the advertisement effectiveness score is a value representing the likelihood that the advertisement will capture a particular user's attention when overlaid on the multimedia content element, wherein the value is any one of: a numerical value, a grade value, and a categorical value.
 13. The system of claim 8, wherein the multimedia content element is at least one of: an image, a graphic, a video stream, a video clip, an audio stream, an audio clip, a video frame, a photograph, and an image of signals. 